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1. Introduction 

The evolution of modern scientific thought is strewn with several examples ex- 
pressing the following sentiment: in any effort to accomplish a task there can be a 
certain limit to the efficiency of its performance. In the present context we bring 
to attention three such famous examples which are based on the combination of a 
deep conceptual approach and simple mathematical arguments. Finally, we shall 
focus on one of them, namely, limits to the efficiency of estimating an unknown 
parameter involved in a family of states of a finite level quantum system. 

Our first example is the celebrated uncertainty principle of Heisenberg [8] in 
quantum mechanics. For an interesting historical account of this great discovery 
in the philosophy of science we refer the reader to the essay by Jagdish Mehra 
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in [13]. If q and p denote the position and momentum operators of a quantum 
mechanical particle executing motion on the real line M they obey the commutation 
relation qp — pq = ih where h = h/2n, h being the Planck's constant, and this 
implies the following inequality. If if> is the absolutely square integrable wave 
function describing the state of the system and Var(Jf |-0) denotes the variance of 
the observable in the state if> then 

Var(g|^)Var(p|^) ^ h 2 /A. (1.1) 

In particular, if the variance of p in the state if) is a 2 then 

Var(g|^) ^ h 2 /Aa 2 . 

In other words, this sets a limit to the accuracy with which the position q can be 
measured in the state if). Such limits to accuracy hold for any 'conjugate pair' of 
observables in quantum theory. 

Our second example is the famous Cramer- Rao inequality [I] , [IB] in the theory 
of estimation of statistical parameters. For an amusing and insightful account of 
the route by which this fundamental discovery was made and how it came to be 
recognized in the history of statistical science we refer to [T9J. Suppose {p(u,9)} 
is a parametric family of probability density functions with respect to a cr-finite 
measure in a Borel space (f2,jF), 9 being a real parameter varying in an open 
interval (a, b). Assume that the function 

m = L logp( ^ e) ) ^ e)fi{duj) (L2) 

is well-defined for all 9 in (a, b). On the basis of a sample point u obtained from ex- 
periment evaluate a function T(uo) as an estimate of the parameter 9. The function 
T(-) on Q is called an unbiased estimator of 9 if 

I T(u)p{u,e)n(du) = 9 V 9e(a,b) 
Jq 

and, in such a case, its variance, denoted by V{T\9) is defined by 

V(T\9) = J(T(lu)-9) 2 p(u,9)fi(du). 

Indeed, V(T\9) is a measure of the error involved in estimating 9 by T(u). The 
Cramer-Rao inequality in its simplest form says that 

V(T\9) > 1(B)- 1 (1.3) 

where 1(9) is given by (II. 2p and called the 'Fisher information' at 9. Thus fll.3p sets 
a limit to the accuracy of estimating the unknown parameter 9 from experimental 
observation. 

It is a remarkable fact that a special case of (11.31) implies the Heisenberg un- 
certainty principle (11.11) and much more in emphasizing the profundity of Fisher 
information. Indeed, let ip G L 2 (M) be a wave function so that H^H = 1- By 
changing if} to a new wave function e iax ip(x), a 6 M, if necessary, we may assume, 
without loss of generality that the momentum operator p satisfies the condition 
(ip\p\ip) = and (ip\q\ip) = m, a real scalar. By Born's interpretation / = \ip\ 2 is 



K. R. Parthasarathy 



3 



the probability density function of the position observable q in the state ib. Intro- 
ducing the parametric family {f(x — 9), 9 G M} of probability densities we see that 
its Fisher information 1(9) is given by 

2 



1(9) 



f(x- 



f(x — 6) dx 

2 



f(x-9) 

= 4 J \ Re ^(x)j \ib(x)\ 2 dx 
and therefore independent of 9. By Cramer-Rao inequality we have 

Var(q\if>) = J (x — m) 2 f(x) dx 
1 

On the other hand 



:i.4) 



I(m) 



;i.5) 



Var(p\if)) = x 2 \(Fib)(x)\ 2 dx 
Jr 

where F is the unitary Fourier transform in L 2 (IR). Thus by (11.41) we have 
Var(p|^) = (ib\F^q 2 F\ib) 

|2 
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\ip'(x)\ 2 dx 



if) 



\ip(x)\ 2 dx 
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\ib(x)\ 2 dx 



I(m) 



;i.6) 



which together with (II. 5ft implies (11.11) . The more powerful inequality (11. 6B and 
its natural generalization for covariance matrices in L 2 (IR n ) are known together as 
Stam's uncertainty principle. For more information along these lines and a rich 
survey of information inequalities we refer to the paper [5] by A. Dembo, T. M. 
Cover and J. A. Thomas. 

Our last illustrious example is of a different genre but again connected with the 
notion of information. It is Shannon's noisy coding theorem [20] which sets a limit 
to the ability of communication through an information channel in the presence 
of noise. Again we present the simplest version of this strikingly beautiful result 
in order to highlight the philosophical aspect and refer to p2] for more general 
versions. Consider an information channel whose input and output alphabets are 
same and equal to the binary alphabet {0, 1} which is also a field of two elements 
with the operations of addition and multiplication modulo 2. If an input letter 
x from this alphabet is transmitted through the channel assume that the output 
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letter is x or x + 1 with probability 1 — p or p so that the probability of error due 
to noise in transmission is p. Such a channel is said to be binary and symmetric. 
Assume that the transmission of a sequence xi, x 2 , ■ ■ ■ , x n of n letters through this 
binary symmetric channel yields the output sequence y 1; y 2 , ■ ■ ■ , y n where y x — x 1; 
y 2 — x 2 , ■ ■ ■ , y n — x n are independently and identically distributed Bernoulli random 
variables, each assuming the values and 1 with probability q = 1 — p and p 
respectively. Such a channel is called a memoryless binary symmetric channel. 

Denote the alphabet by F2. By a code of size m, length n and error probability 
not exceeding e, where < e < 1, we mean m pairs (iij, Ej), 1 ^ j ^ m, Uj G F 2 , 
Ej C Fg, Ei, E 2 , . . . , E m are pairwise disjoint, satisfying the inequalities 

P (output sequence G Ej \ input sequence = Uj) > 1 — e V j. 

Denote by N(n,p,e) the maximum possible size for codes of length n with error 
probability not exceeding e. Then 

lim — log 2 N(n,p, e) = 1 + p log 2 p + q log 2 g V <e < 1, < p< 1/2. (1.7) 

n— >oo77, 

If we write = — p log 2 p — q log 2 g and call it the Shannon entropy of the 

Bernoulli random variable with probability of success (error) p then (11.71) has the 
interpretation that for large n, among the 2 n possible input sequences of length n 
roughly 2 n<yl ~ H<yP ^ sequences could be transmitted with error probability < e and 
not more. For this reason the expression on the right hand side of (11.71) is called 
the Shannon capacity of the binary symmetric channel with error probability p. 
A corresponding generalization for memoryless and stationary quantum channels 
describing their 'capacity' to transmit classical alphabetic messages exists. For a 
leisurely and self-contained exposition of such coding theorems see [TTj . The notion 
of entropy that arises in the brief discussion above can be introduced for a large class 
of density functions and this, in turn, leads to some remarkable connections with 
Fisher information and many powerful information theoretic inequalities. Once 
again we refer to the very rich survey article [3]. 

All the three examples described above have been generalized in several ways, 
connections between them and relations with other branches of science and en- 
gineering have emerged and an enormous amount of literature has grown around 
them. The last example has given birth to the subject of quantum information 
theory and coding theorems for quantum channels [H], [TTj. The present essay is 
devoted to the second example but in the context of parametric families of states 
of finite level quantum systems. Starting from the books of Helstrom [9], Holevo 
|1U] . and Hayashi [7j there is quite some literature on the Cramer Rao bounds for 
quantum systems. By confining ourselves to finite level systems we avoid the tech- 
nical difficulties of dealing with unbounded operators and their varying domains 
but we gain conceptual and algebraic clarity. 

In Section 2 we give a brief account of the quantum probability of finite level 
quantum systems in a complex finite dimensional Hilbert space including the no- 
tions of events, observables, states, generalized measurements and composite sys- 
tems in the language of tensor products of Hilbert spaces. Heisenberg's uncertainty 
principle and an entropic uncertainty principle are briefly described. The notions 
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of parametric families of states and unbiased estimators of parametric functions 
along with their variances and covariances are introduced. 

Section 3 contains the key notions, namely, Fisher maps, the Fisher information 
form and the Cramer- Rao-Bhattacharya (CRB) tensor with respect to a parametric 
family of states of a finite level quantum system. The Cramer-Rao-Bhattacharya 
(CRB) bound is finally expressed in terms of the CRB tensor and the Fisher 
information form. Several illustrative examples are given. 

In the last section we show how, by using a dilation theorem of Naimark, one can 
obtain a CRB bound for the covariance matrix of unbiased estimators of parametric 
functions based on generalized measurements. 

2. Preliminaries in the quantum probability and statistics of finite 

level systems 

A finite level quantum system is described by 'states' in a finite dimensional 
complex Hilbert space. We choose and fix such a Hilbert space Ti with scalar 
product (u\v) which is linear in the variable v and antilinear in u. A typical example 
obtains when Ti is the n-dimensional complex vector space C n of column vectors 
and its dual is the space of all row vectors. In this case the scalar product is 
expressed as 

n 

(u\v) = J^aA 
i=i 

where 





ai 




r bi i 




a 2 




b 2 


u = 




,v = 





Elements of Ti are called ket vectors, a typical element in Ti being denoted by \v) 
whereas any element in the dual of Ti is called a bra vector and a typical bra vector 
is denoted by (u\. The linear functional (u\ evaluated at a ket vector \v) is the 
scalar product (u\v). If A is an operator in Ti it is customary to write 

(u\Av) = (u\A\v). 

The adjoint of A is denoted by A^ so that 

(u\A\v) = (A ] u\v) = (u\Av). 

In such a notation \u)(v\ denotes the operator satisfying 

(\u)(v\) \w) — (v\w)\u) V \u), \v), \w) in Ti. 

The trace of an operator A in Ti is denoted by Tr A. In particular Tr \u)(v\ = (v\u). 
Note that \u)(v\ is a rank one operator when \u) ^ 0, |i>) ^ 0, and 

(|ui)(ui|) (\u 2 )(v 2 \) ■ ■ ■ (\u k )(v k \) = c|ui)(u fc | 

where c = (v^uz) (v 2 \u 3 ) ■ ■ ■ (v k -i\u k ). 

Denote by 13 (Ti), V(Ti), 0(H), S(Ti) respectively the *- algebra of all operators 
on Ti with its usual (strong) topology, the orthomodular lattice of all orthogonal 
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projection operators on Tt, the real linear space of all hermitian operators in Ji 
and the compact convex set of all nonnegative definite operators of unit trace. We 
have V(H) C 0(H) C B(H) and S(H) C 0(H) C B{H). If A,B G 0(H) we say 
that A $C B if B — A is nonnegative definite. Then 0(7i) is a partially ordered real 
linear space. A nonnegative definite hermitian operator is simply called a positive 
operator. 

The zero and identity operators are denoted respectively by O amd /. Often, 
/ is denoted by 1. For any scalar A the operator XI is also denoted by A. Thus, 
for A G B(H), A G C, A - A stands for the operator A - XI. For any E G V(H), 
^ E ^ 1 and (1 — E) G V(H). By a projection we shall always mean an 
orthogonal projection operator i.e., an element of V(H). If E Xl E 2 G V(TC) and 
Ei ^ E 2 then (E 2 — Ei) G V{K). When a quantum system is described by H 
we say that the elements of V(7i) are the events concerning the system, is the 
null event and 1 is the certain event. If E\ ) E 2 G V(TC) and E\ ^ E 2 we say that 
the event E\ implies the event E 2 . If E G V(7i) then 1 — is the event 'not 
If Ei,E 2 G V(TC) their maximum £i V and minimum E\ A £"2 are respectively 
interpreted as 'E 1 or £ 2 ' and l E x and E 2 '. If E X E 2 = then Ei\l E 2 = E 1 + E 2 . If £1 
and E 2 commute then EiAE 2 = E\E 2 . The first basic difference between classical 
probability and quantum probability theory arises from the fact that for three 
events E t in V(H),i = 1, 2, 3 one may not have ExA(E 2 VE 3 ) = (E 1 AE 2 )V(E 1 AE 3 ). 
Whenever the E^s commute with each other the operations A and V distribute with 
each other. 

Any hermitian opearator X in 7i is called a real-valued or simply an observable 
about the system described by 7i. Thus OiTi) is the real linear space of all real- 
valued observables. If X, Y G 0{7i) and XY = YX then XY is also an element 
of 0(7l). If X G 0(H) and o~(X) denotes the set of all its eigenvalues then, by the 
spectral theorem, X admits a unique spectral resolution or representation 

X= XE * (2- 1 ) 

where c(X) C 1R is a finite set of cardinality not exceeding the dimension of 7i, 
0^E x e V(H) V A G a(X) and 

= (2-2) 

E\E\/ = 5 XX ,E X V A,A'Gcr(A). (2.3) 

This, at once, suggests the interpretation that the eigenprojection E\ associated 
with the eigenvalue A in (12.11) is the event that the observable X takes the value A 
and o~(X) is the set of all values that X can take. If ip : c(X) — > R or C is a real 
or complex- valued function then 

<p(X)= J2 ( 2 - 4 ) 

Aecr(X) 

is the real or complex-valued observable which is the function ip of X. 
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Any element p G S(TC) is called a state of the quantum system described by 
X. Such a state p is also called a density operator. Clearly, p itself becomes an 
observable. If E e V{7i) is an event and p is a state then Tt pE is a quantity in 
the unit interval [0, 1] called the probability of the event E in the state p. If E 2 
are two events satisfying the relation E\E 2 = then E% + £2 is also an event 
and Tr p{E\ + E 2 ) = Tr pE\ + Tr pE 2 . However, for two events, E\,E 2 it is not 
necessary that Tr p(Ei V E 2 ) ^ Tr pE\ + Tr pE 2 . In short, subadditivity property 
for probability need not hold good. But this property is retained whenever E\ and 
E 2 commute with each other. 

If p is a state and X is an element of 0{H) with spectral resolution (12.1 j) then 
Tr pi?A is the probability that X takes the value A in the state p whenever A e cr(X). 
Thus the expectation of X in the state p is given by 

^ A Tr pE\ = Tr p A£ A = TrpX. 

AGcr(X) Aecr(X) 

More generally, the expectation of <p(X) defined by (12 .4p is given by Trp<p(X). In 
particular, the variance of X in the state p, denoted by Var(X|p) is given by 

Var(X|p) = TrpX 2 - (TrpX) 2 
= Tr p(X — m) 2 

where m = Tr pX is the expectation or mean of X in the state p. This shows, in 
particular, that Var(X|p) vanishes if and only if the restriction of X to the range 
of p is a scalar multiple of the identity. 

The spectral theorem implies that the extreme points of the convex set S{7i) 
are one dimensional projections of the form where is a unit vector in 

7i. Here, the projection remains unaltered if \ip) is replaced by c\i/j) where c is a 
scalar of modulus unity. Extreme points of S(TC) are called pure states and a pure 
state is a one dimensional projection which, in turn, is determined by a unit vector 
in 7i modulo a scalar of modulus unity. By abuse of language any determining 
unit vector itself is called a pure state. Thus whenever we say that a unit vector 
is a pure state we mean the density operator \ip)(ip\. By spectral theorem any 

state p can be expressed as ^PjlV'i)^! where p l5 p 2 , ... is a finite probability 

3 

distribution and {\if)j),j = 1, 2, . . .} is an orthonormal set of vectors in Ti. If {1^}} 
is any set of unit vectors and pj,j = 1,2,... is a probability distribution then 
J2Pj\4'j)( , ipj\ is a state. If \tp) is a pure state and X is a real-valued observable 

3 

then its variance Var(X||'0)) in the pure state is zero if and only if is an 
eigenvector for X. Thus, even in a pure state an observable need not have a 
degenerate distribution. This is a significant departure from classical probability. 

Hereafter, unless otherwise explicitly mentioned, we shall mean by an observable 
a real- valued observable. Let X, Y be two observables, p a state and let m = Tr pX, 
m! = Tr pY their respective means. Put X = X — m, Y = Y — m! and consider 
the nonnegative function 

f(z) =T±p{X + zY)\X + zY), zeC. 
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Then the inequality 
implies (see [BJ, [16J) 



inf/(z)>0 



Var(X|p)Var(F|p) ^ |Trp^(XF-FX)| + jlr p ^(XY - YX) j (2.5) 

and thus puts a lower bound on the product of the variances of X and Y in a 
state p. The quantum probability of finite level systems we have described here has 
a natural generalization when 7i is an infinite dimensional Hilbert space. When 
TC = L 2 (M), X = q, Y = p are the well-known position and momentum operators 
satisfying the Heisenberg commutation relations qp — pq = iH the inequality (12.51) 
yields the special form 

Var(?||V>))Var(p||<0)) > -j V \^p) eV 

where D is a dense domain in 7i where unbounded operators like qp,pq etc. are 
well-defined. Thus (12. 5ft is at the heart of the Heisenberg's principle of uncertainty. 

Now we introduce a notion which is more general than that of an observable. 
Indeed, it plays an important role in the quantum version of Shannon's coding 
theorems of classical information theory. 

Definition 2.1. A generalized measurement C of a finite level quantum system 
with Hilbert space H is a pair (S, L) where S is a finite set and L : S — > B(TC) is 
a map satisfying the condition: 

Y^L( S yL(s)=I. (2.6) 

Such a generalized measurement £ = (S, L) has the following interpretation. If 
the system is in the state p and the measurement L is performed then the 'value' 
s G S is obtained with probability Tr L(s)pL(sY and the system 'collapses' to a 
new state 

L(s)pL( S y 

TrL(s)pL(s)f 1 ' 

If, for example, the system is initially in the state p, a generalized measurement 
£i = (Si, Li) is performed and followed by another generalized measurement £2 = 
(S 2 ,L 2 ) then the probability of obtaining the value Si G Si is Tr L 1 (si)pL 1 (s 1 )t 
and the conditional probability of getting the value s 2 G S 2 from £ 2 given the value 
Si from Ci is 

T T ( \ \ L l(Sl)pLl(SiY \ f 

1Vi2(S2) lTVL 1 (. Sl )pi 1 ( Sl )tr 2(S2) ' 
Thus the probability of obtaining the value (si, s 2 ) from Ci followed by C 2 is equal 
to 

p(si, s 2 ) = Tr L 2 (s 2 )Li(si) p Li(siY L 2 (s 2 ) j 
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and the final collapsed state is 

L 2 (s 2 )L 1 (s 1 )pL 1 (s i yL 2 (s 2 y 
P(si,s 2 ) 

More generally, if the measurements Ci = (Si, Li), i = 1,2, ... ,m are performed in 
succession on a quantum system with initial state p then the probability 
p(si, s 2 , . . . , s m ) of getting the sequence si, s 2 , . . . ,s m of values Sj G Sj V j is given 
by 

p(si, s 2 , . . . , s m ) = TrL m (s m )L m _i(s m _i) • • • Li(si)pLi(si) t L 2 (s 2 ) t • • • L m (s n 
and the final collapsed state is 

) . . .L 1 (s 1 )pLi(si) t L 2 (s 2 ) t - • • 



p(si, s 2 ,...,s r 

This at once suggests the product rule for measurements Ci = (Si, Li) i = 1, 2 as 
C = (Si x S 2 ,L) where 

L(si, s 2 ) = L 2 (s 2 )Li(si), si e Si, s 2 e S 2 . 
The measurement C stands for the measurement Ci followed by the measurement 

c 2 . 

If C = (S, L) is a measurement with S C M or C then its expectation in the state 
p is given by 

J2sTrL(s)pL( S y = sTrpL( S yL(s). 

ses ses 

If S C K its variance Var(£|p) in the state p is given by 

J2s 2 TrpL( S yL(s) - (j2 sTrpL( S yL(s)] . 

ses \ses J 

When L(s) is a projection for every s G S then (S, L) is called a projective or 
von Neumann measurement. If, in addition, ScK then the hermitian operator 
s L(s) is an observable and our notion of generalized measurement reduces to 

seS 

measuring an observable. It may be of some interest to formulate and obtain an 
uncertainty principle for a pair of two real-valued measurements. 

For a measurement with values in an abstract set S it is natural to replace the 
notion of variance by its entropy in a state p. Thus we consider the quantity 

H(C\p) = -Y,P(s)\og 2 p(s) (2.8) 

ses 

where 

p(s) = Tr pL(s) ] L(s) 

and call it the entropy of the measurement C = (S, L) in the state p. With this 
definition one has the following entropic uncertainty principle. 
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Theorem 2.2 ([IT]. [12]). Let C = (S,L), M = (T, M) be two generalized mea- 
surements of a finite level quantum system in a Hilbert space 7i. Let L(sYL(s) = 
X(s), M(ty M(t) = Y(t), s E S, t E T. Then for any state p the following holds: 

H(C\p) + H(M\p) ^ -21og 2 m&x\\X(s) 1/2 Y(t) 1/2 \\. (2.9) 

s,t 

Remark It is important to note that the right hand side in the inequality (12. 9p 
is independent of p. 

If Xi, 1 ^ % ^ k are k observables, p is a state in Ji and Tr pXi = m{ define the 
scalar 

Uij = - Trp {(Xi - m i )(X j - mj) + (Xj - m j )(X i - m*)} . (2.10) 

Then the real symmetric matrix ((vif)) of order k is called the covariance matrix of 
the observables X±, X 2 , . . . , X^ in the state p and denoted by Cov (X 1; X 2 , . . . , X^ \pj 
It is a positive semidefinite matrix and it is important to note the symmetrization 
in i,j in the right hand side of (12.101) . Without such a symmetrization could 
be a complex scalar. 

Till now we talked about a single quantum system. Suppose we have a com- 
posite quantum system made out of several simple systems Ai,A 2 , . . . , with 
respective Hilbert spaces TCa 1 , / Ha 2 i ■ ■ ■ ^A k - Then the Hilbert space of the joint 
system A\A 2 . . . is the tensor product 

n Al ...A k = h a ,®h m ®---® n Ak . 

This is the quantum probabilistic analogue of cartesian product of sample spaces 
in classical probability. It is clear that 

k 

dim U Al ...A k = JJdim H Ai , 
i=i 

dim indicating dimension. If pi is a state in H Ai Vz then p x <S> ■ ■ • <2) Pk is a state 
of the composite system A\A 2 . . . Af. called the product state. If p is a state in 
r HA 1 ...A k and we take its relative trace over 7i.A il ,7~LA i2 , ■ ■ ■ ^A ie then we get the 
marginal state of the system A ri , A r2 . . . , A r . m where {1, 2, ... , k} is the disjoint 
union i 2 , . . . , ie} U {r 1 , r 2 , . . . , r m } with £ + m = k. In this context of composite 
quantum systems there arises a new distinguishing feature of the subject with a 
remarkable role in physics as well as information theory. It is the existence of a 
very rich class of states in TLA 1 A 2 ...A k which do not belong to the convex hull of all 
product states. Such states are called entangled states and they constitute a rich 
resource in quantum communication [T3] . 

Till now we restricted ourselves to quantum probability. Now we describe a few 
basic concepts in quantum statistics dealing with a parametric family of quantum 
states of a finite level system. Let T be a parameter space and let {p(6),6 e T} 
be a parametric family of states in a Hilbert space H. Suppose X is an observable, 
i.e., an element of 0{Tt) and 

Tr p(9)X = f(6), 6 ET, (2.11) 
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where / is a real-valued function on T. then we say that the observable X is an 
unbiased estimator of the parametric function f on T. 

When the parametric family {p{0), 9 G T} is fixed we write 

Var(X|#) = Var(X|p(0)) (2.12) 

If Xi, X 2 , . . . , X m are m observables we write 

CovpTx, . . . , X m |0) = Cov(X 1; . . . , X m \p{9)). (2.13) 

A real- valued function / on V is said to be estimable with respect to {p{9), 9 G T} 
if there exists an observable X G 0(7i) such that 

Trp(fl)X = /(0) V G T. 

Denote by £(r) the real linear space of all such estimable functions. An observable 
X is said to be balanced with respect to the family {p{9),9 G T} if Tr p{9)X = 
V 9 G T. Denote by M the real linear space of all such balanced observables. For 
any / G S(T), an unbiased estimator X of / write 

v f (9) = inf {Var(A + Z\9), Z G M} . 

It is natural to look for good lower bounds for the function Vf{9). We shall examine 
this problem in the next section and study some examples. If fj, 1 ^ j ^ m 
are estimable parametric functions we shall also look for matrix lower bounds for 
the positive semidefinite matrices Cov(Xi, . . . , X m \9) as each Xi varies over all 
unbiased estimators of fa for each i = 1,2, ... ,m. 

For a more detailed introduction to quantum probability theory we refer to [15J, 
[To] . For an initiation to estimation theory and testing hypotheses in quantum 
statistics we refer to [7j, 0, PH], References [7j, [TU], [E], [TTJ contain applications 
of the theory of generalized measurements. 

3. The Fisher information form and the Cramer- Rao- Bhattacharya 

tensor 

We consider a fixed parametric family {p(9),9 G T} of states of a finite level 
quantum system in a Hilbert space TC with parameter space T. As mentioned in 
the preceding section denote by S(T) and Af respectively the real linear spaces of 
estimable functions and balanced observables. Recall that for any two unbiased 
estimators X and Y of an element / G S(T), the observable X — Y is an element 
of J\f. 

Definition 3.1. A map F : V — > B{7i) is called a Fisher map for the family 
{p(9), 9 G T} of states in TL if the following two conditions hold: 

(i) Tr p(9)F(9) = V 9 G T, 

(ii) Tr p(9) {F(9)^X + XF(9) } = V 9 eT,X eAf. 

Denote by T the real linear space of all Fisher maps with respect to {p(9), 9 G T} 
and by *A(r) the algebra of all real-valued functions on r. If a G *A(r) and F G T 
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then aF defined by (aF)(9) = a(8)F(6) is also in T. In other words T is an 
^4(r)-module. For any two Fisher maps F, G in T define 

I(F,G)(9) = TrpiO) 1 - (F (9)^0(9) + F(9)) 
= ReTr p{9)F{9)iG{9). 

Then I is called the Fisher information form associated with {p{9), 9 G T}. It may 
be noted that, for all F, F\, F 2 , G G T and a G A(T), the following hold: 

1(F,G)=1(G,F), 
I(aF,G) = aI(F,G), 
I(F 1 + F 2 , G) = I(F U G) + I(F 2 , G), 
I(F, F) ^ 0. 

In particular, for any Fi, 1 ^ i ^ n in JF the matrix 

X n (F 1 , F 2 , . . . , F n \9) = {{Z{F h Fj){9))) , 9 G V, i, j G {1, 2, . . . , n} (3.2) 

is positive semidefinite. It is called the information matrix at 9 corresponding to 
the elements Fi, 1 ^ i ^ n in T . 
If / G S(T), F G define 

\(f,F)(9) = Trp(9)±(F(9^X + XF(9)), 9eT (3.3) 

where X is any unbiased estimator of /. Note that, in view of property (ii) in 
Definition 13.11 the right hand side of ( 13.31) is independent of the choice of the 
unbiased estimator of /. Clearly, A(/, F) is real linear in the variable / when F is 
fixed and ^4(r)-linear in the variable F when / is fixed. Thus A(-, •) can be viewed 
as an element of E{T) <S> T. We call A(-, ■) the Cramer-Rao-Bhattacharya tensor or 
simply the CRB-tensor associated with {p(9),9 G T}. 

Let f] G £(r), Xi an unbiased estimator of fi for each 1 ^ i ^ m and let Fj, 
1 ^ J ' ^ n be Fisher maps with respect to {p(9), 9 G T}. Define the m x m matrix 

A mn (9) = ((Ay(0))), l^i^m, Uj<n, #GT (3.4) 
A^(0) = \(fi,Fj)(9) 9eT, (3.5) 

A being the CRB-tensor. We now introduce the family of positive semidefinite 
sesquilinear forms indexed by 9 G T in the vector space B(H) by 

B (X,Y) = TrX^p(9)Y, X,YeB(H). (3.6) 

By property (i) in Definition I3.1[ equations (13.31) and (13. 5p we have 

X l3 {9) = Tr p(9)^{F j (9)\X i -f ,(9)) + (X i -f i (9))F j (9)} 
= ReB e (X i -f i (9),F j (9y). 
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Multiplying both sides by the real scalars a^bj and adding over 1 ^ i ^ m, 1 ^ i ^ 
n, we obtain 

tm n \ 

i=l j=l J 

where A mn and Bg are given by (13. 4p . (13. 5p and (13. 6j) and a, b are respectively col- 
umn vectors of length m, n with prime ' indicating transpose. Now an application 
of the Cauchy-Schwarz inequality to the right hand side of (13. 7p implies 

/ n m N 

(a'A mn (0)b) 2 < Bel^OiiXi-fiieV^aiiXi-fiie)) 



,i=i i=i 

n n 



Vi=i j=i 

= {a'Cov (X 1; X 2 , . . . , X m |0) a} {b'J n (F 1; F 2 , . . . , F n |0) b} . 

Dividing both sides of this inequality by h'I n (-Fi, F 2 , . . . , F n \9) b, fixing a and 
maximizing the left hand side over all b satisfying I n (F ly F 2 , . . . , F n \6) b 7^ we 
obtain the matrix inequality: 

A mn (e)i-(F 1 , f 2 ,..., F n \e)A mn (ey < covpr ls x 2 , . . . x m \e), 

X~ denoting the generalized inverse of T n {F x , F 2 , . . . , F n \6). In other words we have 
proved the following theorem 

Theorem 3.1 (Quantum Cramer- Rao-Bhattacharya (CRB) inequality). Let 

{p(6),6 G T} be a parametric family of states of a finite level quantum system 
in a Hilbert space Ti, fi, 1 ^ i ^ m estimable functions on V, Xi an unbiased 
estimator of fi for each i and let Fj, 1 ^ j ^ n be Fisher maps with respect to 
{p(6), 6 ET}. Then the following matrix inequality holds: 

Cov (X U X 2 , ...,X m \B)> A mn (6)l- F 2 , . . . , F n \6) A mn {6)' V^eT 

where A mn (6) is the m x n matrix defined by (13.31) - (13. 5p and T~[F\, F 2 , . . . , F n \9) 
is the generalized inverse of the Fisher information matrix I n (Fx, F 2 , . . . , F n \9) 
associated with Fi, F 2 , . . . , F n . 



Proof. Immediate. 

Corollary 3.1. Let Xi, 1 ^ i ^ in, Fj, 1 ^ j ^ n be as in Theorem \3.1\ Then 
A mn (6)T~(Fi, F 2 ,..., F n \6)A mn (9)' 

^ A mn _ 1 (^)z-_ 1 (F 1 , f 2 , . . . , F n ^\e)A mn ^{9)' , e e r 

for n ^ 2. 
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Proof. This is immediate from the fact that both the sides of the inequality above 
are arrived at by taking supremum over certain sets in M. n , the set for the left hand 
side being larger than the set for the right hand side. ■ 

We call the right hand side of the inequality in Theorem 13.11 the CRB lower 
bound. 



Remark 1 Theorem 13.11 and Corollary 13.11 imply the possibility of improving 
the CRB lower bound by searching for a larger class of ^.(r)-linearly independent 
Fisher maps for a parametric family of states. 

Remark 2 The CRB lower bound in Theorem 13. II has some natural invariance 
properties. If fi, 1 ^ i ^ m are fixed and X i: Fi(8),p(9) are changed respectively 
to UXiW, UFi(9)W, Up(9)U ] by a fixed unitary operator U in H then the CRB 
lower bound in Theorem 13.11 remains the same. 
If the Fisher maps Fj are replaced by 

n 

G j (0) = Y d a jr {6)F r {6\ l^j^n (3.8) 

r=l 

where the matrix A(9) = ((a rs (6))) is invertible for all 9 then 

A mn (9)l~ (Fi, . . . , F n \9)A mn (9)' = A mn (9)l~(Gi, . . . , G n \9)A mn (9)\ 

the tilde over A mn on the right hand side indicating that GVs are used in place 
of Fi's. In other words the CRB bound is invariant under ^4(r)-linear invertible 
transformations of the form (13. 8ft . 

Example 3.1 Let T = (a,b), H = C n and let 

p(9)=dia,g( Pl (9),p 2 (9),..., Pn (9)), 9eT 

be states in C n with respect to the standard orthonormal basis, diag denoting 
diagonal matrix. An estimable function / on V has the form 



f (9) = ^2^(9) 



i=l 



where (Xi £1X6 real scalars. An unbiased estimator X for / is 

X = diag(ai,a 2 , . . . ,a„). 
Note that Pi(9) ^ and ^2Pi(9) = 1 V 9 6 V. Assume that Pi(9) are differentiable 

in 9 and Pi (9) > V i, 9. Then 



F(9) = diag 
yields a Fisher map with 
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and 

n 

A(/,F) = ]£ 0^(0) = /'(*). 

i=i 

Theorem 13.11 for the single observable X and single Fisher map yields 



1 1 



Var (Y\6) > v=1 n ' V 5 £ (a, 6) 

i=i 

and any unbiased estimator Y of /. This is the Cramer-Rao inequality for finite 
sample spaces in classical mathematical statistics. 

Example 3.2(Quantum version of Barankin's example pQ, [21]). Let p(9) be an 
invertible density operator for every 9 in V. For any 7 £ V define 

F 7 (0) = p(7)pW- 1 - 1. 

Then F 7 is a Fisher map and for any estimable function / £ S(T) we have 

A(/,F 7 )(0) = /( 7 ) -/(*). 

The Fisher information form X satisfies 

J(F 7l ,F 72 )(£) = ReTrp(7i)pW- 1 p(72) - 1 

If X is an unbiased estimate of / £ £(r) one obtains as a special case the CRB 
bound 

Var(X|#) ^ (/( 7l ) - f(9), f{ l2 ) - f(9), . . . , /( 7n ) - /((9)) 
J-( 7l , 72, ••• , 7n, 5) (/(71) - /(C), • • • , /( 7 n) - /(^))' 
where X~(7i, 72, ... , 7 n e) is the generalized inverse of the information matrix 

((ReTrp( 74 )p(e)- 1 p(7 J )-l)) 

for any 71,72, . . . ,J n £ T. 

Example 3.3(Quantum Bhattacharya bound [2]). Let r C M. d be a connected open 
set and let p(0), £ T be a family of invertible states such that the correspondence 
6 — > p(0) is C m -smooth. then every estimable function / is also C m -smooth. For 
any linear differential operator D on T with C m -coeflicients satisfying D 1 = 
define 

F D (0) = ( J Dp)(0)p(0)- 1 

where D is applied to every matrix entry of p(-) on the right hand side in some 
fixed orthonormal basis. Then Fd is a Fisher map and the CRB tensor A satisfies 

\(f,F D )(0) = (Df)(6) V/££(r). 

If D 1 , D 2 are two linear differential operators in T with C m -coefficients annihilating 
the constant function 1 the Fisher information satisfies 

X(F Dl ,F D2 )(0) = Re Tr (D lP )(0) p(e)-\D 2 p)(0), 0eT. 
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If X is an unbiased estimate of / and D i} 1 ^ % ^ n are C m -differential operators 
on T then the CRB inequality has the form 

Var(A|£) ^ (DJ, D n f){0)l-{D x , D 2 , . . . , D^DJ, D n f)(0)> 

where X~(D\, D 2 , . . . , D n \6) is the generalized inverse of the positive semidefinite 
matrix 

((Re Tr {D iP ){9)p{0)-\D jP ){e))) , i,j G {1, 2, . . . ,n}. 

Example 3.4 Example 3.2 leads us to the following natural abstraction. Suppose 
T is a rf-dimensional C m -manifold and 6 — > p(0) is a C m -smooth parametrization 
of states in 7i as 6 varies in V . If L is a smooth vector field on V then 

F L (6) = (Lp)(0)p(0)-\ eev 

is a C m -smooth Fisher map with respect to {p(6),6 G T} under the assumption 
that exists for every 9. C m -smooth Fisher maps constitute a C m (r)-module 

and S(T) C C m (r). The CRB tensor A and the Fisher information form X satisfy 
the relations 

\(f,F L )(6) = (Lf)(6) 
1(F L ,F M )(9) = ReTi{Lp){e)p{e)-\Mp){6) 

for any two vector fields L, M. As a special case of the CRB inequality we have for 
any unbiased estimator X of / G £(r), 

for any C m -smooth vector field L. 

As a special case of the example above, consider a connected Lie group T with 
Lie algebra Q. Let 

p{g) = U g p U}, geT 

where po is a fixed invertible state. Any element L of Q is looked upon as a left 
invariant vector field on T. let U eX p t l = exp t vr(L), t G R, L G Q where L — > n(L) 
is a representation of in 7i. Then the CRB inequality yields 

Var(A|#) ^ d L f)(9)) 2 y L ^ g (3.9) 

Tr p [tt(L),po] 

where A is an unbiased estimator of /. If Lj, 1 ^ i ^ d is a basis for and the 
nonnegative definite matrix Xd is defined by 

J d = ((Re Tr p^ 1 [ir(Li),po] [vfe), p ])) , i,j G {1,2, 

then a maximization over all elements L in on the right hand side of ( 13.91) yields 

Var(A|s) ^ (Li f,L 2 f,..., L d f)(g)l d (LJ, L 2 f, . . . , £„/)(</)', 

being the generalized inverse of 

Example 3. 5 (adapted from [3j). We now consider an example in which the dif- 
ferent states piff) may fail to have an inverse, indeed, their ranges need not be the 
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same. Let r C M. d be an open domain and let p(0), G T obey the set of linear 
partial differential equations of the form 

^- = \(L j (e)p(d) + P (d)L j (e^), l^j^d (3.10) 

where the operators Lj(0) G B(TC). Taking trace on both sides we see that 

Re Tr p(9)Lj(0) = 0, 1 < j ^ d, eT. 

If 1m Tr p(0)Lj(0) = mj(0) we can replace in (EUOl) Lj(0) by Lj(0) - irrij{0) 
without altering the differential equations. Hence we may assume, without loss of 
generality, that in (I3.10P 

Tr p(0)Lj(0) = 0, l^j^d, 6 T. (3.11) 

We then say that the states p{0) which obey (13.101) and (13.111) constitute a Liapunov 
family. 

A special case of such a Liapunov family of states is obtained when d = 1 and 

p{B) =p(6)e^ L p e^ L \ 6eR 
where L is a fixed operator in 7i, po is a fixed state and 



Then 



p{9) = {Tr po e^ 9Lt e 




If po = \ipo)(4>o\ is a pure state then every p(Q) is a pure state. Thus rankp(#) = 
1 V 6 G 1R and we have a situation where {p{0)} admits a 'score operator function' 
with a classical part p'/p and a quantum part L. 

Going back to the Liapunov family satisfying (13.101) and (13. lip we observe that 
each of the maps — > Lj(0), 1 ^ j ^ d is a Fisher map. Indeed, if X is a balanced 
observable we have 

= ^r P (0)X) 

= ^Tr (L J (0)p(0)X + p(0)L J (0yx) 
= ^Trp(e)(L j (eyX + XL j (0)). 



For any estimable function / 



and the Fisher information form X satisfies 

l(L h L 3 )(0) = Re Tr p(6»)L i (6>)%(6»). 

If we write 

J d (0) = ((X(L i ,L i )(0))), i,je{l,2,...,d} 
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then the CRB inequality assumes the form 

Var(X|0) ^ (V/)(0)X d -(0)(V/)(0)' 

for any unbiased estimator X of /, V/ being the gradient vector ^||, Jj£, . . . , j . 

In the special case d = 1 introduced in the course of the discussion above the 
CRB bound assumes the form 

cm) 2 



Var(X|#) > 



If cr(0), 9 6 T are Liapunov families of states in Hilbert spaces 7i, K, 

respectively with coefficients Lj(0), Mj(0) in the respective differential equations 
corresponding to (13. 10|) then the tensor product states p(0)®a(0), G T constitute 
again a Liapunov family with the coefficients Lj(0) ® 1 + 1 <S> Mj(9), 1 ^ j ' ^ d in 
the differential equations corresponding to (13.101) and its Fisher information form 
satisfies 

1 (Li ® 1 + 1 ® % ® 1 + 1 ® Mj) (0) 
= X(L i ,L j )(0)+X(M i ,M j )(0). 



Eexample 3.6 Our last example is the case when p{6) is a mixture of the form 

N 
r=l 

where {p r (0), 1 ^ r ^ N} is a family of probability distributions on the finite set 
{1, 2, . . . , N} indexed by G T and for each fixed r, {p r (0), G T} is a Liapunov 
family of states obeying the differential equations 

^- = \ {L rj (0) p r (o) + Pr (0)L rj (0y} , i < j ^ d, g r 

and the conditions 

Trp r (0)L ri (0) = V 0GT. 

Let now 1 ^ i ^ m be estimable functions with respect to {p(0),0 G T} and 
let Xi be any unbiased estimator of fi for each z. Differentiating with respect to 9j 
the identity 

Trp(0)(X i -f i (9)) = O 

we get 

df N 

■j^ = Yl Vr{0) Re Tr M rj (0)p r (0) (Xi - /,(0)) (3.12) 



j r= i 



where 



M rj (0) = PriO)- 1 ^- + L rj (0). (3.13) 
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Multiplying both sides of (13.1 2p by real scalars a^bj and adding over i and j we get 

a' ((||)) b = E P^ TT \^ b i M ri( )j PrW (it^ ~ ■ 

Applying Cauchy-Schwarz inequality to each trace scalar product on the right 
hand side followed by the same inequality to the scalar product with respect to the 
probability distribution pi(6) , p 2 (6) , . . . ,Pn(6) we obtain 

a' ((!£)) ^ {f>(0)Tr (j2b jMrj (0) 

p r (0) ^X>,M ri (0)j |a / Cov(X 1 ,...,X m |6»)a (3.14) 

Let 

* r (0) = ((Re IV Pr {0)M ri {0yM rj {d))) , i,je{l,2,...,d}, 

N 

= £> r (0)tf P (0). 

r=l 

Then the validity of (13.141) for all a*, bj, 1 ^ i ^ m, 1 ^ j ^ d implies 
Cov(X u X 2 ,...,X m \0) > ((?£)) *-(6) ((0L 



d9jj J v ' \\d9j 
the super index - in \1/ indicating its generalized inverse. 

4. Estimators based on generalized measurements 

As in Section 3 we consider a parametric family {p(0), 9 G T} of states of a finite 
level quantum system in a Hilbert space 7i and a real- valued parametric function / 
on T. In order to estimate / we now look at a generalized measurement £ = (S, L) 
as described in Definition 12 . 1L Choose a real- valued function if on S and if the 
outcome of C is s then evaluate (p(s) and treat it as an estimate of f{9). We say 
that (C, <f) is an unbiased estimator of / if 

J2^)Trp(9)L(s^L(s) = f(e) V^r. (4.1) 

Indeed, it may be recalled from Section 2 that Tr p(9)L(s)^L(s) is the probability 
of the outcome s if the unknown parameter is 9. Then the variance of (£, ip) is 
given by 

Var(£, f\9) = J2 v(s) 2 Tr p{9)L{s)^ L{s) - f{9)\ (4.2) 

ses 

If we write 

X = <p(s)L( S yL(s) (4.3) 
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Then X is an observable and (14. ip shows that X is an unbiased estimator of / 
whenever (£, ip) is an unbiased estimator of /. However, Va.r(X\9) need not be the 
same as Var(C,<p\9). 

In OH]) put T(s) = L{ S yL{s),s G S. Then T{s) ^ and by Definition [2H 
T(s) = I. In other words {T(s), s G S} is a positive operator- valued distribution 

ses 

on 5 1 with total operator mass /. By a well-known theorem of Naimark [10], [IE] one 
can imbed the Hilbert space Ti isometrically in a larger Hilbert space Ti = Ti <8> JC 
with dim fC < oo and construct mutually orthogonal projection operators on Ti 
with the block operator form 



E(s) 



• T( S ) 


M(s) ' 


M(s) T 


iV( S ) 



s G 5 



(4.4) 



satisfying the following: 







s G 5, mGH 



spans 7i. 



Such a dilation of T(-) in W to £?(•) in W is unique upto a natural unitary isomor- 
phism. 

Now we go back to the unbiased estimator (£, ip) of / described in f)4.ip . Put 

m 

X 



' P(0) 


" 









E V(s) E(s). 
ses 

Then {p(#)> ^ G T} is a parametric family of states in Ti., X is an observable in Ti 
and equations (14. Xp and (14.41) imply that Tr f)(6)X = f(9). Furthermore 

Var(X|£) = Trp(6)(X-f(6)) 2 

ses 

= Vai(£,<p\9). 

Thus X is an unbiased estimator of / with respect to {p(9), 9 G T} with the same 
variance as the unbiased estimator (£, <p) based on generalized measurement for 
the original family of states. 

If F is a Fisher map for {p(9), 9 G T} then F defined by 



F{9) 



- F{9) 


" 









g r 



is a Fisher map for {p(9),9 G T} in Ti. If X is the Fisher information form for 
{p(9),9 G T} we have 

1(F 1 ,F 2 )(9)=1(F 1 ,F 2 )(9). 
Thus from Theorem 13.11 we conclude the following theorem. 
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Theorem 4.1. Let {p(9), G T} be a parametric family of states of a finite level 
quantum system in a Hilbert space H and let (£, if) be any unbiased estimator of a 
real-valued parametric function f based on a generalized measurement £ and a real 
scalar function ip on the set of values of the measurement. Suppose Fj, 1 ^ j ^ n 
are Fisher maps for {p{0), 9 G T}. Then 

Var ((£, <p)\0) > (X(f, F a ), A(/, F 2 ),..., X(f, F n )) 
l~(X(f,F 1 ),X(f,F 2 ),...,\(f,F n ))'(6) 

where A is the CRB tensor and I~ is the generalized inverse of the information 
matrix 

l n = ((l(F i ,F j ))), j,ie{l,2,..,n}. 



Proof. Immediate. ■ 

We shall briefly consider the case of estimating many parametric functions fi(6), 
1 ^ i ^ m. In order to estimate them it appears that several generalized mea- 
surements are to be made. Such measurements have to be made in succession. 
As directed in Section 2 we may treat them all as a single compound generalized 
measurement £ = (L,S). Let (£,<fi) be an unbiased estimator of /, for each i. 
Thus the measurement £ is carried out and if the outcome is s G S then (pi(s) is 
the estimate of fi{6). The probability for the outcome s is Tr p(9)L(sy L(s). Thus 
the covariance matrix of the different estimators is given by 

Cov(£,<^i,y9 2 , • • • ,¥m\0) 
ses 

i.j G {1,2,..., m}. (4.5) 

As in the discussion preceding Theorem 14. II we can construct the Naimark dilation 
{E(s),s G S} for the positive operator-valued distribution {L(sYL(s),s G 5"} in 
an enlarged Hilbert space and view the covariance matrix (14.51) as 



cov (x x ,x 2 ,...,x m \e 



for the observables X{, = (fi(s)E(s) with respect to the states p{0). This at once 

s 

leads us to the CRB matrix inequality 

Cov (£, ip u <p 2 ,..., ip m \9) > ((A^, Fj))) ( (1-(F P , F q ))) 
{(X(f h Fj)))' (0), 1 < i < m; j, p, q G {1, 2, . . . , n}. 

for any set {Fj, 1 ^ j ^ n} of Fisher maps, A being the CRB tensor, ((X n (Fj, Fj))) 
the Fisher information matrix with respect to {Fj, 1 ^ j ^ n} and the super index 
- denoting generalized inverse. 
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